Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 141362 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 15.5 MiB |
| Average record size in memory | 115.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Boolean | 3 |
studio has constant value "False" | Constant |
build_year is highly overall correlated with floors_total | High correlation |
building_type_int is highly overall correlated with ceiling_height | High correlation |
ceiling_height is highly overall correlated with building_type_int | High correlation |
floor is highly overall correlated with floors_total | High correlation |
floors_total is highly overall correlated with build_year and 1 other fields | High correlation |
living_area is highly overall correlated with rooms and 1 other fields | High correlation |
price is highly overall correlated with rooms and 1 other fields | High correlation |
rooms is highly overall correlated with living_area and 2 other fields | High correlation |
total_area is highly overall correlated with living_area and 2 other fields | High correlation |
has_elevator is highly imbalanced (52.3%) | Imbalance |
is_apartment is highly imbalanced (92.1%) | Imbalance |
price is highly skewed (γ1 = 88.96515049) | Skewed |
id is uniformly distributed | Uniform |
id has unique values | Unique |
building_type_int has 1927 (1.4%) zeros | Zeros |
kitchen_area has 11701 (8.3%) zeros | Zeros |
living_area has 18588 (13.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-07-10 07:23:31.928132 |
|---|---|
| Analysis finished | 2024-07-10 07:24:16.653130 |
| Duration | 44.72 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIFORM  UNIQUE 
| Distinct | 141362 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70681.5 |
| Minimum | 1 |
|---|---|
| Maximum | 141362 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7069.05 |
| Q1 | 35341.25 |
| median | 70681.5 |
| Q3 | 106021.75 |
| 95-th percentile | 134293.95 |
| Maximum | 141362 |
| Range | 141361 |
| Interquartile range (IQR) | 70680.5 |
Descriptive statistics
| Standard deviation | 40807.839 |
|---|---|
| Coefficient of variation (CV) | 0.57734823 |
| Kurtosis | -1.2 |
| Mean | 70681.5 |
| Median Absolute Deviation (MAD) | 35340.5 |
| Skewness | 0 |
| Sum | 9.9916782 × 109 |
| Variance | 1.6652797 × 109 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 141362 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Other values (141352) | 141352 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 141362 | 1 | |
| 141361 | 1 | |
| 141360 | 1 | |
| 141359 | 1 | |
| 141358 | 1 | |
| 141357 | 1 | |
| 141356 | 1 | |
| 141355 | 1 | |
| 141354 | 1 | |
| 141353 | 1 |
build_year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 118 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1986.6 |
| Minimum | 1901 |
|---|---|
| Maximum | 2023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1901 |
|---|---|
| 5-th percentile | 1957 |
| Q1 | 1969 |
| median | 1985 |
| Q3 | 2007 |
| 95-th percentile | 2017 |
| Maximum | 2023 |
| Range | 122 |
| Interquartile range (IQR) | 38 |
Descriptive statistics
| Standard deviation | 22.136409 |
|---|---|
| Coefficient of variation (CV) | 0.011142861 |
| Kurtosis | -0.1394151 |
| Mean | 1986.6 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.38696514 |
| Sum | 2.8082976 × 108 |
| Variance | 490.02061 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 4461 | 3.2% |
| 2018 | 4386 | 3.1% |
| 1968 | 3502 | 2.5% |
| 2015 | 3466 | 2.5% |
| 1969 | 3322 | 2.3% |
| 1972 | 3256 | 2.3% |
| 1970 | 3210 | 2.3% |
| 1971 | 3197 | 2.3% |
| 1967 | 3191 | 2.3% |
| 2006 | 2874 | 2.0% |
| Other values (108) | 106497 |
| Value | Count | Frequency (%) |
| 1901 | 10 | < 0.1% |
| 1902 | 75 | |
| 1903 | 28 | < 0.1% |
| 1904 | 32 | < 0.1% |
| 1905 | 74 | |
| 1906 | 33 | < 0.1% |
| 1907 | 42 | < 0.1% |
| 1908 | 27 | < 0.1% |
| 1909 | 11 | < 0.1% |
| 1910 | 130 |
| Value | Count | Frequency (%) |
| 2023 | 3 | < 0.1% |
| 2022 | 236 | 0.2% |
| 2021 | 131 | 0.1% |
| 2020 | 587 | 0.4% |
| 2019 | 1257 | 0.9% |
| 2018 | 4386 | |
| 2017 | 4461 | |
| 2016 | 2540 | |
| 2015 | 3466 | |
| 2014 | 2680 |
building_type_int
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.232941 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1927 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4594606 |
|---|---|
| Coefficient of variation (CV) | 0.45143434 |
| Kurtosis | -0.67749025 |
| Mean | 3.232941 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.25416628 |
| Sum | 457015 |
| Variance | 2.1300252 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 78696 | |
| 2 | 25265 | 17.9% |
| 1 | 23164 | 16.4% |
| 6 | 10533 | 7.5% |
| 0 | 1927 | 1.4% |
| 3 | 1773 | 1.3% |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1927 | 1.4% |
| 1 | 23164 | 16.4% |
| 2 | 25265 | 17.9% |
| 3 | 1773 | 1.3% |
| 4 | 78696 | |
| 5 | 4 | < 0.1% |
| 6 | 10533 | 7.5% |
| Value | Count | Frequency (%) |
| 6 | 10533 | 7.5% |
| 5 | 4 | < 0.1% |
| 4 | 78696 | |
| 3 | 1773 | 1.3% |
| 2 | 25265 | 17.9% |
| 1 | 23164 | 16.4% |
| 0 | 1927 | 1.4% |
latitude
Real number (ℝ)
| Distinct | 15720 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.730059 |
| Minimum | 55.21146 |
|---|---|
| Maximum | 56.011032 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 55.21146 |
|---|---|
| 5-th percentile | 55.567215 |
| Q1 | 55.653858 |
| median | 55.724686 |
| Q3 | 55.807323 |
| 95-th percentile | 55.883572 |
| Maximum | 56.011032 |
| Range | 0.79957199 |
| Interquartile range (IQR) | 0.15346527 |
Descriptive statistics
| Standard deviation | 0.10261107 |
|---|---|
| Coefficient of variation (CV) | 0.0018412159 |
| Kurtosis | -0.35001837 |
| Mean | 55.730059 |
| Median Absolute Deviation (MAD) | 0.076637268 |
| Skewness | -0.0075277291 |
| Sum | 7878112.6 |
| Variance | 0.010529032 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55.77080536 | 565 | 0.4% |
| 55.78516769 | 381 | 0.3% |
| 55.83548737 | 285 | 0.2% |
| 55.55776596 | 257 | 0.2% |
| 55.74351883 | 251 | 0.2% |
| 55.83442688 | 235 | 0.2% |
| 55.77090073 | 220 | 0.2% |
| 55.78421021 | 210 | 0.1% |
| 55.69994354 | 192 | 0.1% |
| 55.59153748 | 167 | 0.1% |
| Other values (15710) | 138599 |
| Value | Count | Frequency (%) |
| 55.21146011 | 5 | |
| 55.21229935 | 3 | < 0.1% |
| 55.21334457 | 2 | < 0.1% |
| 55.21493912 | 10 | |
| 55.21569824 | 3 | < 0.1% |
| 55.30972672 | 3 | < 0.1% |
| 55.31040192 | 2 | < 0.1% |
| 55.3139801 | 1 | < 0.1% |
| 55.31425476 | 2 | < 0.1% |
| 55.31481552 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 56.0110321 | 14 | |
| 56.00934601 | 18 | |
| 56.00914001 | 5 | < 0.1% |
| 56.00882339 | 2 | < 0.1% |
| 56.00848389 | 1 | < 0.1% |
| 56.00812149 | 4 | < 0.1% |
| 56.00805664 | 7 | < 0.1% |
| 56.00751877 | 4 | < 0.1% |
| 56.00745773 | 4 | < 0.1% |
| 56.00717163 | 2 | < 0.1% |
longitude
Real number (ℝ)
| Distinct | 15271 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.589235 |
| Minimum | 36.864372 |
|---|---|
| Maximum | 37.946411 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 36.864372 |
|---|---|
| 5-th percentile | 37.354996 |
| Q1 | 37.491764 |
| median | 37.581146 |
| Q3 | 37.691055 |
| 95-th percentile | 37.828963 |
| Maximum | 37.946411 |
| Range | 1.0820389 |
| Interquartile range (IQR) | 0.19929123 |
Descriptive statistics
| Standard deviation | 0.15012178 |
|---|---|
| Coefficient of variation (CV) | 0.0039937439 |
| Kurtosis | 0.31447965 |
| Mean | 37.589235 |
| Median Absolute Deviation (MAD) | 0.094726562 |
| Skewness | -0.1160192 |
| Sum | 5313689.4 |
| Variance | 0.022536548 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37.5642128 | 581 | 0.4% |
| 37.56404114 | 407 | 0.3% |
| 37.65834808 | 316 | 0.2% |
| 37.55502319 | 257 | 0.2% |
| 37.42210007 | 251 | 0.2% |
| 37.65960312 | 246 | 0.2% |
| 37.56266785 | 227 | 0.2% |
| 37.37809753 | 220 | 0.2% |
| 37.63718414 | 188 | 0.1% |
| 37.45571518 | 168 | 0.1% |
| Other values (15261) | 138501 |
| Value | Count | Frequency (%) |
| 36.86437225 | 13 | |
| 36.86503601 | 20 | |
| 36.86519623 | 3 | < 0.1% |
| 36.86552811 | 3 | < 0.1% |
| 36.86582565 | 16 | |
| 36.86593246 | 1 | < 0.1% |
| 36.8693924 | 5 | < 0.1% |
| 36.87020111 | 4 | < 0.1% |
| 36.87042618 | 2 | < 0.1% |
| 36.87133408 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 37.94641113 | 27 | < 0.1% |
| 37.94482803 | 19 | < 0.1% |
| 37.9413147 | 43 | |
| 37.94092178 | 27 | < 0.1% |
| 37.94085693 | 105 | |
| 37.94051743 | 7 | < 0.1% |
| 37.94021988 | 10 | < 0.1% |
| 37.93956375 | 82 | |
| 37.93946457 | 26 | < 0.1% |
| 37.9389267 | 15 | < 0.1% |
ceiling_height
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 77 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7536498 |
| Minimum | 2 |
|---|---|
| Maximum | 27 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2.48 |
| Q1 | 2.6400001 |
| median | 2.6400001 |
| Q3 | 2.8 |
| 95-th percentile | 3.0999999 |
| Maximum | 27 |
| Range | 25 |
| Interquartile range (IQR) | 0.15999985 |
Descriptive statistics
| Standard deviation | 0.22327539 |
|---|---|
| Coefficient of variation (CV) | 0.081083437 |
| Kurtosis | 1009.3248 |
| Mean | 2.7536498 |
| Median Absolute Deviation (MAD) | 0.059999943 |
| Skewness | 11.564608 |
| Sum | 389261.44 |
| Variance | 0.049851898 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.640000105 | 58512 | |
| 3 | 22933 | 16.2% |
| 2.700000048 | 14993 | 10.6% |
| 2.480000019 | 7904 | 5.6% |
| 2.799999952 | 7425 | 5.3% |
| 2.74000001 | 6724 | 4.8% |
| 3.200000048 | 3819 | 2.7% |
| 2.5 | 3813 | 2.7% |
| 3.099999905 | 2906 | 2.1% |
| 2.75 | 2118 | 1.5% |
| Other values (67) | 10215 | 7.2% |
| Value | Count | Frequency (%) |
| 2 | 12 | < 0.1% |
| 2.25 | 1 | < 0.1% |
| 2.299999952 | 5 | < 0.1% |
| 2.400000095 | 31 | < 0.1% |
| 2.450000048 | 17 | < 0.1% |
| 2.480000019 | 7904 | |
| 2.5 | 3813 | |
| 2.50999999 | 2 | < 0.1% |
| 2.529999971 | 9 | < 0.1% |
| 2.539999962 | 104 | 0.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | < 0.1% |
| 8 | 6 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 17 | < 0.1% |
| 5.199999809 | 1 | < 0.1% |
| 5 | 6 | < 0.1% |
| 4.599999905 | 4 | < 0.1% |
| 4.5 | 14 | < 0.1% |
| 4.190000057 | 40 | |
| 4.150000095 | 47 |
flats_count
Real number (ℝ)
| Distinct | 706 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 251.99323 |
| Minimum | 1 |
|---|---|
| Maximum | 4455 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 58 |
| Q1 | 111 |
| median | 200 |
| Q3 | 324 |
| 95-th percentile | 624 |
| Maximum | 4455 |
| Range | 4454 |
| Interquartile range (IQR) | 213 |
Descriptive statistics
| Standard deviation | 207.33617 |
|---|---|
| Coefficient of variation (CV) | 0.82278468 |
| Kurtosis | 12.928546 |
| Mean | 251.99323 |
| Median Absolute Deviation (MAD) | 100 |
| Skewness | 2.702838 |
| Sum | 35622267 |
| Variance | 42988.287 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 4189 | 3.0% |
| 144 | 2820 | 2.0% |
| 84 | 2638 | 1.9% |
| 72 | 2335 | 1.7% |
| 215 | 2281 | 1.6% |
| 287 | 2179 | 1.5% |
| 60 | 1967 | 1.4% |
| 192 | 1791 | 1.3% |
| 98 | 1663 | 1.2% |
| 143 | 1600 | 1.1% |
| Other values (696) | 117899 |
| Value | Count | Frequency (%) |
| 1 | 199 | |
| 2 | 71 | 0.1% |
| 3 | 20 | < 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 47 | < 0.1% |
| 6 | 40 | < 0.1% |
| 7 | 19 | < 0.1% |
| 8 | 62 | < 0.1% |
| 9 | 41 | < 0.1% |
| 10 | 151 |
| Value | Count | Frequency (%) |
| 4455 | 1 | < 0.1% |
| 1630 | 565 | |
| 1623 | 257 | |
| 1586 | 5 | < 0.1% |
| 1198 | 116 | 0.1% |
| 1189 | 11 | < 0.1% |
| 1183 | 34 | < 0.1% |
| 1149 | 19 | < 0.1% |
| 1133 | 58 | < 0.1% |
| 1114 | 84 | 0.1% |
floors_total
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.107554 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 9 |
| median | 14 |
| Q3 | 17 |
| 95-th percentile | 25 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 6.8980454 |
|---|---|
| Coefficient of variation (CV) | 0.48896113 |
| Kurtosis | 5.5000767 |
| Mean | 14.107554 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.5626381 |
| Sum | 1994272 |
| Variance | 47.58303 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 25042 | |
| 17 | 23221 | |
| 12 | 17177 | |
| 5 | 12702 | |
| 14 | 11566 | |
| 16 | 10260 | |
| 22 | 6147 | 4.3% |
| 25 | 4476 | 3.2% |
| 8 | 4005 | 2.8% |
| 24 | 2631 | 1.9% |
| Other values (54) | 24135 |
| Value | Count | Frequency (%) |
| 1 | 11 | < 0.1% |
| 2 | 54 | < 0.1% |
| 3 | 485 | 0.3% |
| 4 | 930 | 0.7% |
| 5 | 12702 | |
| 6 | 1727 | 1.2% |
| 7 | 2062 | 1.5% |
| 8 | 4005 | 2.8% |
| 9 | 25042 | |
| 10 | 2502 | 1.8% |
| Value | Count | Frequency (%) |
| 99 | 1 | < 0.1% |
| 81 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 59 | 4 | < 0.1% |
| 58 | 77 | |
| 57 | 39 | < 0.1% |
| 56 | 184 | |
| 55 | 1 | < 0.1% |
has_elevator
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.2 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 126856 | |
| False | 14506 | 10.3% |
floor
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 56 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4673462 |
| Minimum | 1 |
|---|---|
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 10 |
| 95-th percentile | 18 |
| Maximum | 56 |
| Range | 55 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 5.7171437 |
|---|---|
| Coefficient of variation (CV) | 0.76561921 |
| Kurtosis | 5.2689924 |
| Mean | 7.4673462 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.6832674 |
| Sum | 1055599 |
| Variance | 32.685732 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 14701 | |
| 3 | 13668 | 9.7% |
| 5 | 12700 | 9.0% |
| 4 | 12510 | 8.8% |
| 1 | 11435 | 8.1% |
| 6 | 9609 | 6.8% |
| 7 | 9319 | 6.6% |
| 8 | 8772 | 6.2% |
| 9 | 8492 | 6.0% |
| 10 | 5645 | 4.0% |
| Other values (46) | 34511 |
| Value | Count | Frequency (%) |
| 1 | 11435 | |
| 2 | 14701 | |
| 3 | 13668 | |
| 4 | 12510 | |
| 5 | 12700 | |
| 6 | 9609 | |
| 7 | 9319 | |
| 8 | 8772 | |
| 9 | 8492 | |
| 10 | 5645 | 4.0% |
| Value | Count | Frequency (%) |
| 56 | 6 | |
| 55 | 6 | |
| 54 | 2 | < 0.1% |
| 53 | 11 | |
| 52 | 8 | |
| 51 | 10 | |
| 50 | 4 | < 0.1% |
| 49 | 7 | |
| 48 | 8 | |
| 47 | 3 | < 0.1% |
kitchen_area
Real number (ℝ)
ZEROS 
| Distinct | 1036 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.001579 |
| Minimum | 0 |
|---|---|
| Maximum | 203 |
| Zeros | 11701 |
| Zeros (%) | 8.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6.0999999 |
| median | 8.8000002 |
| Q3 | 10.2 |
| 95-th percentile | 17.299999 |
| Maximum | 203 |
| Range | 203 |
| Interquartile range (IQR) | 4.0999999 |
Descriptive statistics
| Standard deviation | 5.2640755 |
|---|---|
| Coefficient of variation (CV) | 0.58479468 |
| Kurtosis | 31.917312 |
| Mean | 9.001579 |
| Median Absolute Deviation (MAD) | 2.1999998 |
| Skewness | 2.8364444 |
| Sum | 1272481.2 |
| Variance | 27.710491 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 15584 | 11.0% |
| 10 | 13323 | 9.4% |
| 0 | 11701 | 8.3% |
| 9 | 9594 | 6.8% |
| 8 | 7550 | 5.3% |
| 7 | 5654 | 4.0% |
| 12 | 4591 | 3.2% |
| 11 | 3566 | 2.5% |
| 8.5 | 3225 | 2.3% |
| 15 | 2301 | 1.6% |
| Other values (1026) | 64273 |
| Value | Count | Frequency (%) |
| 0 | 11701 | |
| 1.5 | 1 | < 0.1% |
| 1.700000048 | 2 | < 0.1% |
| 1.799999952 | 1 | < 0.1% |
| 2 | 41 | < 0.1% |
| 2.099999905 | 1 | < 0.1% |
| 2.25 | 3 | < 0.1% |
| 2.299999952 | 2 | < 0.1% |
| 2.400000095 | 3 | < 0.1% |
| 2.5 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 203 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 100 | 3 | |
| 90 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
| 85 | 1 | < 0.1% |
| 83.40000153 | 1 | < 0.1% |
| 81 | 1 | < 0.1% |
| 80 | 6 | |
| 78 | 1 | < 0.1% |
living_area
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 2345 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.056948 |
| Minimum | 0 |
|---|---|
| Maximum | 700 |
| Zeros | 18588 |
| Zeros (%) | 13.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 19 |
| median | 29.4 |
| Q3 | 41.400002 |
| 95-th percentile | 67 |
| Maximum | 700 |
| Range | 700 |
| Interquartile range (IQR) | 22.400002 |
Descriptive statistics
| Standard deviation | 23.96864 |
|---|---|
| Coefficient of variation (CV) | 0.77176416 |
| Kurtosis | 29.084183 |
| Mean | 31.056948 |
| Median Absolute Deviation (MAD) | 10.5 |
| Skewness | 3.2220584 |
| Sum | 4390272.3 |
| Variance | 574.49569 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 18588 | 13.1% |
| 19 | 5974 | 4.2% |
| 20 | 4899 | 3.5% |
| 30 | 3820 | 2.7% |
| 18 | 3622 | 2.6% |
| 32 | 3007 | 2.1% |
| 28 | 2318 | 1.6% |
| 31 | 2165 | 1.5% |
| 45 | 2147 | 1.5% |
| 34 | 1996 | 1.4% |
| Other values (2335) | 92826 |
| Value | Count | Frequency (%) |
| 0 | 18588 | |
| 2 | 2 | < 0.1% |
| 3 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 5.5 | 2 | < 0.1% |
| 5.599999905 | 1 | < 0.1% |
| 5.800000191 | 1 | < 0.1% |
| 6 | 16 | < 0.1% |
| 6.400000095 | 2 | < 0.1% |
| 6.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 700 | 1 | < 0.1% |
| 500 | 1 | < 0.1% |
| 490 | 1 | < 0.1% |
| 433 | 1 | < 0.1% |
| 430 | 2 | |
| 426 | 2 | |
| 403 | 2 | |
| 394 | 3 | |
| 382.7999878 | 1 | < 0.1% |
| 380 | 1 | < 0.1% |
rooms
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1294761 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.99433956 |
|---|---|
| Coefficient of variation (CV) | 0.46694094 |
| Kurtosis | 3.719362 |
| Mean | 2.1294761 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.0604964 |
| Sum | 301027 |
| Variance | 0.98871117 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 52676 | |
| 1 | 42029 | |
| 3 | 36930 | |
| 4 | 7091 | 5.0% |
| 5 | 1802 | 1.3% |
| 6 | 556 | 0.4% |
| 7 | 179 | 0.1% |
| 8 | 55 | < 0.1% |
| 10 | 19 | < 0.1% |
| 9 | 18 | < 0.1% |
| Other values (4) | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 42029 | |
| 2 | 52676 | |
| 3 | 36930 | |
| 4 | 7091 | 5.0% |
| 5 | 1802 | 1.3% |
| 6 | 556 | 0.4% |
| 7 | 179 | 0.1% |
| 8 | 55 | < 0.1% |
| 9 | 18 | < 0.1% |
| 10 | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 12 | 4 | < 0.1% |
| 10 | 19 | < 0.1% |
| 9 | 18 | < 0.1% |
| 8 | 55 | < 0.1% |
| 7 | 179 | 0.1% |
| 6 | 556 | 0.4% |
| 5 | 1802 |
is_apartment
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.2 KiB |
| False | |
|---|---|
| True | 1372 |
| Value | Count | Frequency (%) |
| False | 139990 | |
| True | 1372 | 1.0% |
studio
Boolean
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.2 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 141362 |
total_area
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3358 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62.374644 |
| Minimum | 11 |
|---|---|
| Maximum | 960.29999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 32.5 |
| Q1 | 39.299999 |
| median | 53 |
| Q3 | 72 |
| 95-th percentile | 127 |
| Maximum | 960.29999 |
| Range | 949.29999 |
| Interquartile range (IQR) | 32.700001 |
Descriptive statistics
| Standard deviation | 40.295864 |
|---|---|
| Coefficient of variation (CV) | 0.64602956 |
| Kurtosis | 50.131605 |
| Mean | 62.374644 |
| Median Absolute Deviation (MAD) | 14.200001 |
| Skewness | 5.1673471 |
| Sum | 8817404.4 |
| Variance | 1623.7566 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38 | 3580 | 2.5% |
| 45 | 2807 | 2.0% |
| 39 | 2463 | 1.7% |
| 40 | 2188 | 1.5% |
| 60 | 2108 | 1.5% |
| 52 | 2078 | 1.5% |
| 54 | 1992 | 1.4% |
| 33 | 1935 | 1.4% |
| 35 | 1818 | 1.3% |
| 44 | 1492 | 1.1% |
| Other values (3348) | 118901 |
| Value | Count | Frequency (%) |
| 11 | 2 | |
| 11.30000019 | 1 | < 0.1% |
| 11.5 | 2 | |
| 11.69999981 | 2 | |
| 12 | 3 | |
| 12.10000038 | 2 | |
| 12.22999954 | 1 | < 0.1% |
| 12.30000019 | 2 | |
| 12.55000019 | 2 | |
| 12.60000038 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 960.2999878 | 2 | |
| 925 | 2 | |
| 920 | 1 | |
| 901 | 1 | |
| 872.5999756 | 1 | |
| 800 | 1 | |
| 764.1 | 1 | |
| 761 | 1 | |
| 760 | 1 | |
| 757.7999878 | 2 |
price
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 8384 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19441620 |
| Minimum | 11 |
|---|---|
| Maximum | 9.8737377 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 6500000 |
| Q1 | 8900000 |
| median | 11850000 |
| Q3 | 16950000 |
| 95-th percentile | 49500000 |
| Maximum | 9.8737377 × 109 |
| Range | 9.8737377 × 109 |
| Interquartile range (IQR) | 8050000 |
Descriptive statistics
| Standard deviation | 66269544 |
|---|---|
| Coefficient of variation (CV) | 3.4086431 |
| Kurtosis | 11579.848 |
| Mean | 19441620 |
| Median Absolute Deviation (MAD) | 3450000 |
| Skewness | 88.96515 |
| Sum | 2.7483063 × 1012 |
| Variance | 4.3916525 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10500000 | 2287 | 1.6% |
| 9500000 | 2057 | 1.5% |
| 12500000 | 1982 | 1.4% |
| 8500000 | 1922 | 1.4% |
| 11500000 | 1855 | 1.3% |
| 11000000 | 1702 | 1.2% |
| 12000000 | 1687 | 1.2% |
| 13500000 | 1558 | 1.1% |
| 9000000 | 1493 | 1.1% |
| 10000000 | 1411 | 1.0% |
| Other values (8374) | 123408 |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 85 | 1 | < 0.1% |
| 1500 | 1 | < 0.1% |
| 2000 | 3 | |
| 2300 | 1 | < 0.1% |
| 2400 | 1 | < 0.1% |
| 2500 | 3 | |
| 2600 | 1 | < 0.1% |
| 2700 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9873737728 | 1 | |
| 9799999488 | 1 | |
| 9200000000 | 1 | |
| 8147034112 | 1 | |
| 4686425088 | 1 | |
| 4447152128 | 1 | |
| 4048056064 | 1 | |
| 3525904640 | 1 | |
| 2551801088 | 2 | |
| 2500000000 | 1 |
| build_year | building_type_int | ceiling_height | flats_count | floor | floors_total | has_elevator | id | is_apartment | kitchen_area | latitude | living_area | longitude | price | rooms | total_area | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| build_year | 1.000 | -0.019 | 0.361 | 0.455 | 0.395 | 0.732 | 0.491 | 0.021 | 0.138 | 0.435 | -0.201 | -0.006 | -0.217 | 0.186 | 0.008 | 0.270 |
| building_type_int | -0.019 | 1.000 | -0.569 | 0.092 | 0.050 | 0.106 | 0.343 | -0.029 | 0.127 | -0.088 | -0.042 | -0.118 | 0.117 | -0.369 | -0.189 | -0.295 |
| ceiling_height | 0.361 | -0.569 | 1.000 | 0.049 | 0.128 | 0.217 | 0.008 | 0.039 | 0.000 | 0.315 | -0.026 | 0.147 | -0.120 | 0.492 | 0.219 | 0.448 |
| flats_count | 0.455 | 0.092 | 0.049 | 1.000 | 0.265 | 0.483 | 0.085 | -0.001 | 0.236 | 0.171 | -0.158 | 0.006 | -0.087 | 0.001 | -0.037 | 0.072 |
| floor | 0.395 | 0.050 | 0.128 | 0.265 | 1.000 | 0.518 | 0.270 | 0.010 | 0.150 | 0.240 | -0.049 | 0.033 | -0.086 | 0.148 | 0.017 | 0.146 |
| floors_total | 0.732 | 0.106 | 0.217 | 0.483 | 0.518 | 1.000 | 0.389 | 0.014 | 0.336 | 0.434 | -0.105 | 0.024 | -0.179 | 0.196 | -0.006 | 0.221 |
| has_elevator | 0.491 | 0.343 | 0.008 | 0.085 | 0.270 | 0.389 | 1.000 | 0.004 | 0.017 | 0.219 | -0.058 | 0.019 | -0.038 | 0.065 | -0.001 | 0.083 |
| id | 0.021 | -0.029 | 0.039 | -0.001 | 0.010 | 0.014 | 0.004 | 1.000 | 0.028 | 0.022 | 0.020 | 0.069 | 0.001 | 0.044 | 0.018 | 0.030 |
| is_apartment | 0.138 | 0.127 | 0.000 | 0.236 | 0.150 | 0.336 | 0.017 | 0.028 | 1.000 | 0.020 | 0.032 | -0.025 | -0.024 | 0.047 | -0.012 | 0.007 |
| kitchen_area | 0.435 | -0.088 | 0.315 | 0.171 | 0.240 | 0.434 | 0.219 | 0.022 | 0.020 | 1.000 | -0.101 | 0.358 | -0.134 | 0.336 | 0.155 | 0.395 |
| latitude | -0.201 | -0.042 | -0.026 | -0.158 | -0.049 | -0.105 | -0.058 | 0.020 | 0.032 | -0.101 | 1.000 | 0.034 | -0.001 | 0.063 | 0.033 | -0.021 |
| living_area | -0.006 | -0.118 | 0.147 | 0.006 | 0.033 | 0.024 | 0.019 | 0.069 | -0.025 | 0.358 | 0.034 | 1.000 | -0.018 | 0.467 | 0.653 | 0.642 |
| longitude | -0.217 | 0.117 | -0.120 | -0.087 | -0.086 | -0.179 | -0.038 | 0.001 | -0.024 | -0.134 | -0.001 | -0.018 | 1.000 | -0.146 | -0.037 | -0.108 |
| price | 0.186 | -0.369 | 0.492 | 0.001 | 0.148 | 0.196 | 0.065 | 0.044 | 0.047 | 0.336 | 0.063 | 0.467 | -0.146 | 1.000 | 0.650 | 0.765 |
| rooms | 0.008 | -0.189 | 0.219 | -0.037 | 0.017 | -0.006 | -0.001 | 0.018 | -0.012 | 0.155 | 0.033 | 0.653 | -0.037 | 0.650 | 1.000 | 0.870 |
| total_area | 0.270 | -0.295 | 0.448 | 0.072 | 0.146 | 0.221 | 0.083 | 0.030 | 0.007 | 0.395 | -0.021 | 0.642 | -0.108 | 0.765 | 0.870 | 1.000 |
| id | build_year | building_type_int | latitude | longitude | ceiling_height | flats_count | floors_total | has_elevator | floor | kitchen_area | living_area | rooms | is_apartment | studio | total_area | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1965 | 6 | 55.717113 | 37.781120 | 2.64 | 84 | 12 | True | 9 | 9.90 | 19.900000 | 1 | False | False | 35.099998 | 9500000.0 |
| 1 | 2 | 2001 | 2 | 55.794849 | 37.608013 | 3.00 | 97 | 10 | True | 7 | 0.00 | 16.600000 | 1 | False | False | 43.000000 | 13500000.0 |
| 2 | 3 | 2000 | 4 | 55.740040 | 37.761742 | 2.70 | 80 | 10 | True | 9 | 9.00 | 32.000000 | 2 | False | False | 56.000000 | 13500000.0 |
| 3 | 4 | 2002 | 4 | 55.672016 | 37.570877 | 2.64 | 771 | 17 | True | 1 | 10.10 | 43.099998 | 3 | False | False | 76.000000 | 20000000.0 |
| 4 | 5 | 1971 | 1 | 55.808807 | 37.707306 | 2.60 | 208 | 9 | True | 3 | 3.00 | 14.000000 | 1 | False | False | 24.000000 | 5200000.0 |
| 5 | 6 | 2017 | 4 | 55.724728 | 37.743069 | 2.70 | 192 | 17 | True | 9 | 0.00 | 0.000000 | 2 | False | False | 51.009998 | 8490104.0 |
| 6 | 7 | 1964 | 4 | 55.795589 | 37.722622 | 2.64 | 180 | 5 | False | 1 | 6.18 | 29.340000 | 2 | False | False | 44.520000 | 9500000.0 |
| 7 | 8 | 2015 | 2 | 55.656345 | 37.424335 | 3.00 | 512 | 11 | True | 7 | 13.50 | 0.000000 | 1 | False | False | 52.000000 | 17990000.0 |
| 8 | 9 | 1982 | 4 | 55.574734 | 37.668686 | 2.64 | 127 | 16 | True | 7 | 8.18 | 19.100000 | 1 | False | False | 35.919998 | 6300000.0 |
| 9 | 10 | 1982 | 4 | 55.994698 | 37.196686 | 2.64 | 142 | 12 | True | 5 | 8.00 | 30.000000 | 2 | False | False | 50.000000 | 5900000.0 |
| id | build_year | building_type_int | latitude | longitude | ceiling_height | flats_count | floors_total | has_elevator | floor | kitchen_area | living_area | rooms | is_apartment | studio | total_area | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 141352 | 141353 | 1973 | 6 | 55.592419 | 37.604187 | 2.48 | 855 | 12 | True | 1 | 7.50 | 19.299999 | 1 | False | False | 34.900002 | 8150000.0 |
| 141353 | 141354 | 2004 | 4 | 55.853230 | 37.646873 | 2.74 | 513 | 17 | True | 9 | 7.00 | 18.000000 | 1 | False | False | 38.000000 | 9200000.0 |
| 141354 | 141355 | 1969 | 4 | 55.626068 | 37.608238 | 2.50 | 282 | 9 | True | 3 | 6.70 | 29.000000 | 2 | False | False | 45.000000 | 11300000.0 |
| 141355 | 141356 | 2008 | 4 | 55.872646 | 37.634228 | 2.74 | 128 | 17 | True | 9 | 12.90 | 33.900002 | 2 | False | False | 64.000000 | 14800000.0 |
| 141356 | 141357 | 1971 | 4 | 55.740402 | 37.834579 | 2.64 | 428 | 9 | True | 8 | 6.00 | 42.000000 | 3 | False | False | 64.000000 | 10800000.0 |
| 141357 | 141358 | 2013 | 4 | 55.626579 | 37.313503 | 2.64 | 672 | 25 | True | 16 | 11.00 | 18.000000 | 1 | False | False | 42.000000 | 10500000.0 |
| 141358 | 141359 | 1960 | 1 | 55.727470 | 37.768677 | 2.48 | 80 | 5 | False | 5 | 5.28 | 28.330000 | 2 | False | False | 41.110001 | 7400000.0 |
| 141359 | 141360 | 1966 | 4 | 55.704315 | 37.506584 | 2.64 | 72 | 9 | True | 7 | 5.30 | 20.000000 | 1 | False | False | 31.500000 | 9700000.0 |
| 141360 | 141361 | 2017 | 4 | 55.699863 | 37.939564 | 2.70 | 480 | 25 | True | 15 | 13.80 | 33.700001 | 2 | False | False | 65.300003 | 11750000.0 |
| 141361 | 141362 | 1988 | 4 | 55.862133 | 37.689613 | 2.74 | 128 | 17 | True | 16 | 7.60 | 18.000000 | 1 | False | False | 38.000000 | 8000000.0 |